Headset computer that uses motion and voice commands to control information display and remote devices

ABSTRACT

A wireless hands-free portable headset computer with a micro display arranged near but below a wearer&#39;s eye in a peripheral vision area not blocking the wearer&#39;s main line of sight. The headset computer can display an image or portions of an image, wherein the portions can be enlarged. The headset computer also can be equipped with peripheral devices, such as light sources and cameras that can emit and detect, respectively, visible light and invisible radiation, such as infrared radiation and ultraviolet radiation. The peripheral devices are controllable by the wearer by voice command or by gesture. The headset computer also can be broken down into component parts that are attachable to another article worn by an individual, such as a helmet or respirator mask.

RELATED APPLICATIONS

This application is a continuation of U.S. application Ser. No. 13/468,207, entitled “Headset Computer That Uses Motion And Voice Commands To Control Information Display And Remote Devices,” filed on May 10, 2012, which claims the benefit of U.S. Provisional Patent Application No. 61/484,464, entitled “Hands-Free Enhanced Digital Eyewear,” filed on May 10, 2011. The entire teachings of the above applications are incorporated herein by reference.

BACKGROUND

The present application relates to human/computer interfaces and more particularly to a wearable headset computer that accepts voice commands, tracks hand gestures and/or detects head movements to provide inputs to control software running within the headset computer and/or peripheral devices.

Small, portable electronic devices capable of storing and displaying large amounts of high resolution computer graphic information and video content continue to be increasingly popular. Devices such as the Apple iPhone™, Google Android™ and other smartphones represent a significant trend in convergence among mobile telephones, portable computers and digital media players (iPhone is a trademark of Apple Computer, Inc. and Android is a trademark of Google, Inc.). Although these smartphones typically include a display screen, the visual experience of a high-resolution, large format display cannot easily be replicated of because physical size limitations in the handheld form factor.

Other devices which provide improved functionality over smart phones are known by various names such as headset computers, video eyewear, head mounted displays with embedded computer processors, and the like. These devices include a frame or other support mechanism that is worn about the face and/or head, similar to a pair of eyeglasses and/or headphones. The frame houses a small, high-resolution microdisplay, optical lenses and other components needed to present an electronic image to the wearer. Circuitry in the headset computer can include display drivers, wireless interface(s), and fully functional personal computer systems. For further information regarding such headset computers, refer to corresponding patent application entitled “Handheld Wireless Display Devices Having High-Resolution Display Suitable for Use as a Mobile Internet Device”, PCT International Application Number PCT/US09/38601 filed Mar. 27, 2009, the entire contents of which is hereby incorporated by reference.

Such devices are also further described in U.S. Application Nos. 61/300,611, filed on Feb. 2, 2010, titled “Head Mounted Video Eyewear With Accessory Mount;” U.S. Ser. No. 12/774,179, filed on May 5, 2010, titled “Remote Control Of Host Application Using Motion And Voice Commands;” 61/176,662, filed on May 8, 2009, titled “Remote Control Of Host Application Using Tracking And Voice Commands;” 61/237,884, filed on Aug. 28, 2009, titled “Remote Control Of Host Application Using Motion And Voice Commands;” U.S. Ser. No. 12/008,114, filed on Jan. 8, 2008, titled “Monocular Display Device;” and U.S. Ser. No. 12/008,104, filed on Jan. 8, 2008, titled “Monocular Display Device;” the contents of each of which are incorporated by reference in their entirety.

SUMMARY

A headset computer includes a microdisplay, multiple input devices such as a head tracking accelerometer and/or camera to detect movements such as head movements, hand motions and/or gestures, and audio processing circuits to detect voice commands. These inputs provide control over and operation of an application program running within the headset computer itself and/or peripherals associated with the headset computer.

In one implementation, voice, head motion, and/or hand gesture inputs are received from sensors located within the headset computer. A field of view into a 3-D virtual space is then determined from the voice, head motion and/or hand gesture inputs. Data representing the 3-D virtual space may be maintained either by a processor local to the headset computer device and/or a remote processor. The 3-D virtual space contains data representing one or more graphical objects. The graphical objects may include various elements such as computer desktops, application windows, digital images, photographs, 3-D models or other image data. The voice, head motion, and/or hand gesture inputs may determine both the field of view and scale factor to determine a viewpoint into the 3-D virtual space. Image data selected from the 3-D virtual space determined based on the field of view and scale factor, and then presented on the microdisplay.

The hand gesture, head motion and/or voice commands can be used not only to set the field of view and scale factor but also to select which of several graphic objects within the 3-D virtual space are selected for presentation on the microdisplay, in whole or in part. Thus by using these input commands the wearer of the headset may navigate through a large format 3-D space and completely control which portions of the 3-D space are seen on the microdisplay.

The head set computer may also include a light source and camera. The spectral properties of the light source and camera can be manipulated by the input commands to provide a synthetic vision function. In particular, the scene illuminated by the light source is detected by the camera, and in turn presented on the microdisplay. The light source and camera preferably operate in invisible electromagnetic portion of the spectrum including infrared, near infrared, ultraviolet, shortwave infrared or other invisible wavelengths. In this manner, the wearer of the headset computer has the ability to view scenes in an invisible portion of the spectrum.

The voice, head motion and/or hand gestures can control an operating wavelength for the light source and the camera, intensity of emissions from the light source, sensitivity of the camera, or other aspects of presentation of the synthetic vision function on the micro display.

The light source also can be used to determine range information. In particular, the light source may emit a high precision light, such as a laser light. A reflection of the light by an object can then be detected by a camera or other sensor capable of determining a round-trip time delay for the light. The headset computer can then determine a range to one or more points in physical space. This can provide not only a range to a given object but also, for example, a distance between two objects. In a case where the distances to more than two points are determined, a volume of space can be estimated.

The headset computer can be packaged to fit headgear such as a helmet. In one implementation, the processors and other electronic components can be disposed in afirst housing, a second housing may carry the microdisplay on a boom and a third housing may include a power supply. One or more signal and/or power connectors are then provided between the various housings. The housings are separately attachable and detachable from the headgear. This permits retrofitting of a headset computer to helmets or other headgear that safety, security, and military personnel are accustomed to wearing. This then eliminates the need for them to fit completely assembled headsets inside of or on top of their headgear. In certain embodiments the housing for the micro display and/or processor may include noise cancellation circuits that may assist with operations in a noisy environment such as with a rebreather apparatus.

The headset computer may also control a remote vehicle and receive and display images returned from a camera on the remote vehicle. Control commands derived from voice, head motion, and hand gesture inputs can be used as a remote control for an associated vehicle. In other embodiments, the control command can alter position and/or other operating characteristics of a camera located on the vehicle. The image information returned from the camera on the remote vehicle may be displayed on the micro display.

A wearer of the headset computer can therefore experience using the remote vehicle as if he were a miniature pilot traveling on the vehicle, operating the vehicle while having a view of the scene around the vehicle displayed on the micro display. A wireless interface between the headset computer and the vehicle can be used to detect control commands for the vehicle.

The control commands may also control apparatus that are mounted on the vehicle such as payloads and other devices to be delivered by vehicle. In certain aspects, the voice command can generate complex commands for the vehicle such as “return to base”, “circle at a specific altitude”, and so forth.

BRIEF DESCRIPTION OF THE DRAWINGS

The foregoing will be apparent from the following more particular description of example embodiments of the disclosure, as illustrated in the accompanying drawings in which like reference characters refer to the same parts throughout the different views. The drawings are not necessarily to scale, emphasis instead being placed upon illustrating various embodiments.

FIG. 1 shows a headset computer that implements several functions described herein.

FIG. 2 depicts a viewport into a 3-D virtual space that may be manipulated with voice, head motions, or hand gestures.

FIGS. 3A and 3B depict a synthetic enhanced vision mode and range finding mode.

FIGS. 4A and 4B illustrate the synthetic vision mode in more detail, with FIG. 4A being a black screen seen with the light source off, and FIG. 4B a scene that is in view when the light source is enabled.

FIGS. 5A and 5B illustrate an implementation where components of the headset computer are individually attached to a helmet.

FIG. 6 illustrates a wireless joystick and mouse controller that can be used with the headset computer to control another device such as a vehicle.

FIGS. 7A, 7B and 7C illustrate more examples of vehicles that can be controlled by the headset computer operating components contained thereon.

FIG. 8 shows a wireless joystick used with the headset computer.

FIG. 9 is a more detailed view of the headset computer.

FIG. 10 is a high-level block diagram of the functions performed by the headset computer.

FIG. 11 is a more detailed block diagram of the components of the headset computer.

DETAILED DESCRIPTION

FIG. 1 illustrates a headset computer 100 that is worn on the head of a person. The headset computer 100 is also known as video eyewear, head mounted display (HMD) with embedded computer, and by various other names. In this example, the headset computer 100 consists of apparatus, processor(s), and software that control the presentation of objects on microdisplay 120 and peripheral devices. The microdisplay 120 may be enclosed in a display housing 125 supported by a boom 132 that attaches to a frame 140. The frame 140 may additionally contain a housing 150 to enclose further electronic components; in the example shown here the housing 150 is positioned on the back of the wearer's head. One or more speakers 135 deliver audio signals to the wearer's ears and similarly one or more microphones 137 either contained in the housing 125 and/or in other areas detect audio signals. A peripheral such as a camera 160 may be attached to a peripheral port 165. The headset computer 100 may be further supported, for example, by a strap 170 on the wearer's head.

As will be explained in detail below in connection with FIGS. 9, 10 and 11, the headset computer 100 is a completely contained personal computer system including one or more data processor(s) for generating images on the microdisplay 120 and performing other functions. The processor(s) located in the headset computer 100 are capable of interpreting voice commands, detecting hand movements of the wearer (such as through the camera) 160 and/or detecting the wearer's head movements through accelerometers or other motion sensors. These inputs are then interpreted as commands to either the headset computer 100 and/or remote devices with which the headset computer can communicate, such as over wireless interfaces.

Viewport Into 3-D Virtual Space

One function performed by the headset computer 100 is to provide a graphical viewport and/or window into a 3-D virtual space. The graphical viewpoint determines which information is presented on the microdisplay 120. In this mode, for example, a movement by the wearer's head can bring a different section of that 3-D virtual space into view on the microdisplay 120.

FIG. 2 illustrates this in some detail. Here the wearer 200 is shown relative to the virtual 3-D space 300. The virtual 3-D space 300 has a number of windows 310, 320, 330, 340, 350 located within it and maintained by the processor(s) within the headset computer 100 or the external host. However, only a single one of the windows 340 is shown without shading, to indicate that it is an active window. The other windows 310, 320, 330 and 350 are shown grayed out. Thus while these other windows exist in the 3-D virtual space, the wearer 200 does not see the entire 3-D virtual space—rather the user 200 sees only the window 340 (or more typically, a selected portion thereof) on the microdisplay 120. The window 340 may be simply a viewport into for example, a graphic window in a windowed operating system. In the example shown here, the graphical object is a digital map image and the wearer is viewing only a portion of the map and not the whole map.

It will be understood that the 3-D virtual space may include various elements such as computer desktops, application windows, photographs, 3-D object models or any other type of digital image objects. It should be further understood that these image objects can be positioned next to, overlaid on or behind or beside one another in the 3-D virtual space.

The user can manipulate the various image objects by giving commands using the headset computer 100. In one example, the user can ask for a level of enlargement of a particular area of interest within one of the objects. The location and size of the window area may be selected by the tracking of head motions, voice commands and/or hand gestures. For example, the user may specify a position and magnification and/or zoom level to be applied to a particular application software window. The result is similar to using a magnifying glass to look at something seamlessly over a large area, but by using the head tracker/gesture detector/voice input detection to zoom into an area being seen on the microdisplay 120 and at what level of magnification. Thus, using this feature the user can move his head left, right, up or down and then select a particular one of the image objects 300, 310, 320 through 340 to be active. In one example, the user 200 might from the position shown in FIG. 2, turn his head to the right. This motion would then cause a new window 330 to then become the active window, with window 340 then becoming deactivated.

The user 200 can also issue commands to retain a piece of a large image that he wishes to magnify, freezing that portion on the screen and setting it aside and then going back and looking at another area of that image or even requesting another level of magnification for that other area. In this way, the user can view the same portions of an image at different levels of magnification and/or view different bits or pieces of a larger image at different levels of magnification and then switch between them by merely moving his head left or right, up or down

In yet another example, the wearer may issue voice commands to manipulate the position of the various image objects in the 3-D virtual space. For example, he may select an image object such as by moving his head, but then issue a voice command such as to “move object up” or “move object A behind object B”. This causes the head tracker to then control the relative position of the selected image object(s) within the 3-D virtual space, rather than allowing him to navigate among a given single object within the 3-D space.

It will be understood that the wearer 200 thus has access to a virtual desktop that is in any form factor that can be represented in a 3-D virtual space, i.e. he may be working in a 360° surface that wraps around his head or may be given the impression that he is working in a 3-D space with a long depth of field.

In another example, the user 200 may turn his head to the lower left causing the window 350 to become active. This window may be a 3-D model of an object such as an engine. The user may then proceed to manipulate this 3-D model using voice, head tracking and/or hand gesture commands to manipulate the viewpoint in 3-D space. The wearer may also issue a command to manipulate the model itself, such as to say, “rotate object 90° horizontal” causing the representation of the motor to rotate in 3-D space.

The view of the displayed image on the microdisplay 120 does not require the user to be physically oriented as if he were looking in any particular direction. For example, the user may remotely view any image being virtually generated in a sitting or standing position as might be projected on a wall in a room, but yet that wearer may be himself physically oriented in other positions such as laying down.

Hands-Free Synthetic Vision

FIGS. 3A and 3B illustrate other modes and functions provided by the headset computer 100. In this example, an emitter such as a light source 195 is located within the headset computer 100 typically within the same small housing as the camera 160. If the wavelength of the camera 1600 and light source 195 are coordinated such that the camera is sensitive at the same wavelength emitted by the light source(s) then a synthetic vision function can result. For example, the light source may be infrared or ultraviolet, and if the camera is similarly sensitive in these wavelength regions, the resulting image from the camera can be seen on to the microdisplay 120. The processor in headset computer 100 can also cause not just the image to be captured by the camera and directly viewed on the microdisplay 120, but to also be stored in memory and/or be sent to a remote system or display.

Using the headset computer 100, the wearer can thus experience hands-free synthetic vision that combines a synthetic view that is, for example, a far infrared view showing heat signatures of individuals or objects on the other side of a wall or other obstruction. An example of the same is shown in FIGS. 4A and 4B. With the light source 195 off, the image on the microdisplay is completely blank. However, with the light source 195 on, the infrared camera picks up the emission of the infrared and see an image that would not otherwise be visible. Using the headset computer 100, with an integrated infrared light source and camera, the wearer is then able to see what is in the dark environment on the microdisplay 120, but himself remain unseen by those relying only on natural non-enhanced sight. The individual 350 in the scene would not be able to detect the wearer's presence, because the infrared illumination from the light source would not be visible to the unaided eye.

As shown in FIG. 3B, the light source may also include a laser range finder 198. The laser 198 can be aimed by the user either by moving his head and/or by using voice commands. The range finder can be used for various functions such as finding the distance to an object in physical space, determining relative distance between two things. In the latter example, the user may aim his head at one object and measure a distance to that first object, and then move his head to otherwise aim the laser at second object. The user can then ask the headset computer 100 to solve the triangulation equation between his position and the two objects, thereby estimating a distance between the two objects.

In a further example, a volume of space can be estimated by the wearer aiming the laser at three or more points and asking the headset computer to figure out the distances between them. These functions can be useful in uses such as surveying or material estimating necessary. This can now be accomplished without the wearer actually moving about or by using measuring implements other than the laser range finder as built into the headset computer 100.

Components Retrofittable to Helmet

FIGS. 5A and 5B illustrate another example of the headset computer 100 packaged in a particular way to be retrofit onto existing headgear. In this implementation, headset computer electronics (including peripheral devices such as a camera and battery power source), the main processor and so forth may be packaged into one component housing 502, and the boom with an integrated optical/visual/audio part packaged as another component housing 501. The components 501, 502 are individually attachable to an existing headgear familiar to the user. In this example shown, a helmet 500 may have two Velcro™ pads, with the electronics component 502 attached to one pad Velcro™, and a battery 503 separately packaged and attached to the other Velcro™ pad (Velcro is a trademark of the Velcro Corporation). The boom element 501 is also attached to the helmet, such as via a mechanical clip or fastener; the boom of course including the integrated microdisplay and microphones. A cabling system can connect the microdisplay and boom 501 to the electronics 502 and battery 503. The boom 501 may be fixed and/or bolted to the left side or right side of the helmet, depending on user preference and also depending upon other equipment that the user may be operating.

Camera(s), laser(s), and other peripherals can also be mounted to the helmet 500. Instead of requiring the wearer to wear a dedicated headset under the helmet, this packaging approach can implement a headset computer functionality without the user having to become comfortable with new headgear. In addition, operation with certain types of headgear (such as a rebreather) is not affected. This particular end use may be improved if the on board electronics also provide for noise cancellation. For example, if the wearer is using a rebreather, the rebreather tends to make a lot of background noise that would otherwise interfere with voice inputs or sound recording. The on-board electronics may include noise cancellation circuits or programming that eliminate the background noise of the rebreather. A similar approach can be used to cancel out other background noises to allow for clearer recording of voices or other sounds.

Headset Computer Controls Remote Vehicle, Receives and Displays Images From and To the Remote Vehicle

In yet another implementation, the voice, head motion and/or hand gesture inputs received from the sensors located within the headset computer 100 can be used to derive a remote control command. That control command can then be sent over a wireless interface to control a remote vehicle robot, or other object. In this end use, the input device may also further include a wireless joystick and/or mouse to provide further inputs to control the vehicle.

In one example, a voice input to the headset computer can generate a control command to control the path of the vehicle. Voice commands, such as “turn right”, “turn left”, “move forward”, “move backward”, “stop” and so forth can be included in the processing capabilities of the headset computer 100. Similarly, head tracking inputs can generate a control command to control the path of the vehicle, or more commonly the direction of the camera on the vehicle. In this way, the user can obtain an experience that he is physically located on the vehicle. This is accomplished by having the camera on the vehicle transmitting video preferably wirelessly back to the headset computer. The video received at the remote vehicle can then be displayed on the display within the headset computer.

In yet another example, a wireless handheld controller 610 such as that shown in FIG. 6 can be used with the headset computer 100 to control the path position, attitude and/or direction of the vehicle 620 more naturally.

Using this arrangement, a person can control a vehicle such as an unmanned aerial vehicle (FIG. 7A), unmanned ground vehicle (FIG. 7B) or a toy (FIG. 7C) and so forth. This eliminates problems with prior art that simply operates a remote vehicle 620 with a videogame type of controller that requires almost total user attention and both hands to operate. By utilizing wireless joystick controller 610 in combination with the wearable headset computer 100 that can obtain head motion, voice and hand tracking commands, the control and electronic processing capabilities of the headset computer can give anyone control such as a soldier, policeman, fire or industrial worker control over one or more remote systems or vehicles 620 in simple and natural way.

In the absence of a separate user input device, the camera on the headset computer 100 may detect the user's hand gestures as control inputs. The wearer can also give speech commands to give the vehicle certain commands. For example, if the wearer says “freeze”, that can be detected by the headset computer which then translates the spoken command into one or more commands to control the flight path of the unmanned aerial vehicle, to stop doing everything else and simply hover or follow a circular flight path around a current point of interest.

In other examples a voice command such as “return to base” can cause the vehicle to follow a complex programmed flight path. Another example can be “circle at a specific altitude” which can cause the vehicle to generally follow a geo-stable circle around its present location. This can alleviate the user from tediously having to continuously provide commands via the handheld controller.

Other voice commands and hand held commands can be used to control other aspects of the vehicle's capabilities, performance and/or path of travel.

In one embodiment, the vehicle 620 may itself contain a camera that transmits its video output wirelessly back to the headset computer 100. Video carried back to the headset computer 100 is then displayed on the microdisplay 120. The wearer's head movements and/or gestures may then be used in a natural way to control the position, attitude, pan, zoom, magnification, light spectral sensitivities or other capabilities of the camera on the remote vehicle. The user's head movements can then be tracked by the on board electronics of the headset computer 100 and translated by the headset computer into commands that are sent back to aim the camera of the unmanned vehicle. As an example, if the wearer looks to the left, that motion is detected by the head tracker in the headset computer, translated into a camera “move left” command. That “move left” command is then sent wirelessly to the remote vehicle, causing the camera on the remote vehicle to pan to the left.

By returning the video stream back from the vehicle and displaying it on the microdisplay gives the wearer a visual experience as if he were, for example, a miniature pilot inside an unmanned aerial vehicle.

In yet another function, the user can, for example, use speech commands to control other peripherals that the vehicle itself might contain. An unmanned aerial vehicle such as shown in FIG. 8, may carry a payload such as a camera or other sensor to be dropped at a remote location. These payloads, weapons or other objects that the vehicle is capable of delivering can be controlled by the user of the headset computer 100. Control over these payloads can be implemented regardless of what the vehicle itself is being commanded to do.

System Description

FIG. 9 shows a wireless headset computer 100 (also referred to as a video eyewear device 100) that incorporates a high resolution (VGA or better) microdisplay element and other features described below. Headset computer 100 typically includes many different types of integrated circuits including a microprocessor (single or multi-core), one or more wireless interfaces, associated memory or other storage devices, one or more cameras (optical sensors) and/or various sensors. These sensors may include audio input and/or output devices, such as one or more microphone(s) input and output speaker(s) the sensors may include geo-positional sensing, 3 axis to 9 axis degrees of freedom orientational sensors (such as a digital magnetometer), atmospheric sensors, health condition sensors, GPS, digital compass, pressure sensors, environmental sensors, energy sensors, acceleration, position, attitude, motion, velocity or optical sensors, and cameras (visible, infrared, etc.). Further circuits such as additional wireless radios, auxiliary lighting, range finders, or the like, and/or an array of sensors may be embedded in and/or attached to the device. Also typically located within the device 100 are a peripheral mount or mounts such as a “hot shoe” (not shown in FIG. 9) for mounting optional accessories such as cameras or additional sensors. The camera(s), motion sensor(s) and/or sensor(s) are used to track the motion and/or position of the user's head, hands and/or body in at least a first axis 111 (horizontal), but preferably also a second (vertical), a third (depth), a fourth (pitch), a fifth (roll) and a sixth (yaw).

The headset computer device 100 can be used in various ways. It can be used as a completely contained, head-mounted fully functional portable personal computer/smart phone with full connectivity to external computers and networks through a short and/or long-range wireless links such as Bluetooth, WiFi, cellular, LTE, WiMax or other wireless radios.

Device 100 can be also used as a remote display for a streaming video signal provided by a remote host computer. The host may be, for example, a laptop, cell phone, Blackberry, iPhone™, or other computing device having lesser or greater computational complexity than the device 100 itself. The host then provides information to the device 100 to be displayed. The device 100 and host are connected via one or more suitable wireless connections such as provided by the Bluetooth WiFi, cellular, LTE, WiMax or other wireless radio link. The host may itself be further connected to other networks such as through a wired or wireless connection to the Internet.

While what is shown in FIG. 9 is a monocular microdisplay presenting a single fixed display element supported on the face of the user with a cantilevered boom, it should be understood that other mechanical configurations for various video eyewear devices 100 are possible.

In the FIG. 9 implementation, headset computer 100 includes generally a frame 1000, a strap 1002, a back section 1004, speaker 1006, cantilever or arm 1008, and microdisplay subassembly 1010. On one side of the device 100 opposite the cantilever arm 1008 is a peripheral port 1020. The peripheral port 1020 provides corresponding connections to one or more peripheral devices, so a user can removably attach various accessories to the device 100. As an example port 1020 provides a mechanical and electrical accessory mount such as a hot shoe. Wiring carries electrical signals from port 1020 through, for example, the back portion 1004 to circuitry disposed therein. Hot shoe 1020 can operate much like the hot shoe on a camera, automatically providing connections to power the accessory and carry signals to and from the rest of device 100. Various types of accessories can thus be used with port 1020 to provide the hand movements, head movements, and or vocal inputs to the system, such as but not limited to microphones, positional, orientation and other previously described sensors, cameras, and the like.

FIG. 1 was a view of the headset computer 100 as worn on the head of a user where an accessory 1060 has been placed in the hot shoe port 1020. This accessory 1060 is a self-contained camera (or other motion sensor) assembly. The camera 1060 can include both audio and video sensing, recording, and light emission capabilities in a package similar to a “bullet cam”. It can be connected to the remaining components in device 100 via built in wiring in the back section 1004 (as in the case of the speaker previously described) or can be wirelessly connected via a Bluetooth™ or WiFi™ connection. The camera 1060 may not necessarily be a video camera, but may also detect infrared, ultraviolet, or other wavelengths. The camera 1060 can also include a user adjustable auxiliary light source. With the light source, the camera 1060 can also be used as a light source as desired without activating the camera portion.

The camera, motion tracking and audio inputs to the device 100 are interpreted as user commands in various ways to control operation of the local processor, the microdisplay, or the external host.

Head movement tracking and/or vocal commands can also be provided by the user 1050 to manipulate the settings of camera 1060. For example, a user vocal command, such as “zoom” or “pan”, can be recognized by the local processor and cause the camera 1060 to zoom in or telephoto out.

FIG. 10 is a block diagram showing more detail of the device 100, optional host 200 and the data that travels between them. The device 100 receives audio signals input via the microphone, hand movements or body gestures via positional and orientation sensors, the camera or optical sensor(s), and head movement inputs via the head tracking circuitry such 3 axis to 9 axis degree of freedom orientation sensors. These are translated by software in a processor local to the device 100 into commands. These commands may then be interpreted by a processor internal to the device 100 to control aspects of the presentation of information on the microdisplay or other objects such as a peripheral or remote vehicle. The commands may also be sent over the Bluetooth or other wireless interface 150 to the host 200. The host 200 then interprets these translated commands in accordance with its own operating system/application software to perform various functions, and or returns information to the device 100. In one example, the device 100 and/or host 200 maintain a 3-D virtual space into what graphic objects are stored.

Among the commands that can be carried out on the local processor and/or the remote host 200 is one to select a field of view 300 within the virtual display. Thus, it should be understood that a very large format virtual display area might be associated with operating system or application software running on the device 100 or on the host 200. However, only a portion of that large virtual display area within the field of view is returned to and actually displayed by the remote control display device 120 as selected by the voice, hand gestures, or head motion commands.

FIG. 11 is a simplified high level block diagram of a non-limiting example embodiment of the headset computer device 100. The system electronics can be placed on or in the frame in an appropriate location (such as back section 1004) and include an Open Media Application Platform (OMAP) as the local processor 4110, a power/audio companion chip 4102, a display driver 4104, a head tracker circuit board 4106, and wireless LAN/Bluetooth interface 4108. Also located in the housing is a power source, such as a lithium ion battery 4200.

The device 100 may also include an eye pod assembly 4000 that includes the aforementioned microdisplay 4010 (e.g. the microdisplay 1010 and boom 1008 of FIG. 2A), and one or more microphones 4020. One or more speakers 4030 are positioned in the HMD housing earpiece near the user's ear (item 1006 in FIG. 9). The head tracker circuitry 4106 may include circuits to determine head movements and gestures detected by sensors in the device 100, such as lateral movements along and rotation gestures around the X, Y and Z axes using Hall effect sensors, MIM diodes, accelerometers, gyros and/or transducers or other sensors as mentioned above.

Device system 100 may also receive inputs from external input devices such as a wireless mouse, track ball, or keyboard that may be wirelessly connected through the Bluetooth interface 4108.

Software in the WLAN/BT front end 4108, the OMAP 4100 and/or host 200 may be used to interpret hand gestures detected by the camera or other sensors. A camera board 4060 may optionally provide video input, as well.

The OMAP processor 4100 may include a central processing unit, and on-chip memory such as Random Access Memory (RAM) that may include non volatile memory and/or Read Only Memory (ROM). The OMAP may be a Texas Instruments model OMAP 3530 processor or newer version sold by Texas Instruments, Inc. and using a multimedia processor. The OMAP 4100 may typically execute an embedded system such as operating a particular version of MicroSoft Windows®. The OMAP 4100 is generally a more powerful, and more power consuming processor than the WLAN/BT interface 4108.

In this example, a TPS 65950 power/audio companion chip, also available from Texas Instruments, provides audio, USB, keypad control and battery charging functions to the system.

The WLAN/BT interface 4108 may be a model LBEE 1W8 NEC-interface circuit, a Bluetooth circuit such as available from CSR, Ltd. of Cambridge, United Kingdom or other radio module with similar or greater capabilities.

The display driver may be a model KCD-A 910 display driver available from Kopin Corporation of Westborough, Mass.

The microdisplay 4010, also available from Kopin, can include models CyberDisplay 230K, WQVGA, VGA, WVGA, SVGA or other manufactures' acceptable microdisplays.

An NCS module 4400 takes raw microphone signal data as input, and outputs audio data with background noise removed. It produces an audio signal to the audio companion chip 4102 and from there to the OMAP processor 4100. Voice recognition is performed in software on the OMAP processor 4100, using the cleaned up microphone signals as fed in by the NCS 4400.

The teachings of all patents, published applications and references cited herein are incorporated by reference in their entirety.

While this disclosure has described several example embodiments, it will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the scope of the invention encompassed by the appended claims. 

What is claimed is:
 1. A method performed by one or more processors disposed within a headset computer device, comprising: operating a light source located within the headset computer according to voice, head motion, and/or hand gesture input(s) detected by sensors located within the headset computer, the light source emitting light away from the headset computer device and into an external environment to be synthetically visualized; operating a camera located within the headset computer according to voice, head motion, and/or hand gesture input(s) detected by sensors located within the headset computer; the light source and camera operating at a non-visible wavelength; displaying image data captured by the camera on a microdisplay disposed within the headset computer; determining a field of view and scale factor from the voice, head motion and/or hand gesture inputs; and selecting an image portion to be presented on the microdisplay from the image data depending on the field of view and scale factor.
 2. The method of claim 1 wherein the light source and camera operate at an infrared (IR) or ultraviolet (UV) wavelength.
 3. The method of claim 1 additionally comprising: receiving voice, head motion and/or hand gesture inputs and in response thereto, controlling an operating wavelength for the light source and the camera.
 4. The method of claim 1 additionally comprising: receiving voice, head motion and/or hand gesture inputs and in response thereto, controlling an intensity of emissions from the light source.
 5. The method of claim 1 additionally comprising: receiving voice, head motion, and/or hand gesture inputs, and in response thereto, controlling a sensitivity of the camera.
 6. The method of claim 1 wherein the light source is a laser range finder, and the image data includes range information.
 7. The method of claim 1 further comprising: determining range information for at least two points; determining distance information between the two points from the range information; and displaying the distance information on the microdisplay.
 8. The method of claim 1 wherein the range information is determined for multiple points and further comprising: determining volume information for a space defined by the multiple points; and displaying the volume information on the microdisplay.
 9. A method performed by one or more processors disposed within a headset computer device, comprising: receiving image data from a remote vehicle; receiving voice, head motion, and/or hand gesture input(s) from sensors located within the headset computer; determining a control command from the voice, head motion, and/or hand gesture inputs; and sending the control command to the remote vehicle, wherein the control command controls a camera located on the remote vehicle.
 10. The method of claim 9 additionally comprising: in response to sending the control command, receiving image data from the camera located on the remote vehicle; presenting the image data on a microdisplay disposed within the headset computer.
 11. The method of claim 9 wherein the control command is a first control command and wherein the method further comprises determining a second control command from the voice, head motion, and/or hand gesture inputs, wherein the second control command controls operation of the remote vehicle.
 12. The method of claim 11 wherein the control command is a first control command and wherein a voice input generates a second control command to control a path of the remote vehicle.
 13. The method of claim 12 wherein a head motion input generates a third control command to control the camera on the remote vehicle.
 14. The method of claim 9 wherein the control command controls position, attitude and/or direction of the camera.
 15. The method of claim 14 wherein the control command is a first control command and wherein the method further comprises: receiving an input from a handheld controller; and determining a command to control a path of the remote vehicle from the handheld controller input.
 16. The method of claim 15 wherein the voice input generates a hover command to cause the remote vehicle and camera to remain fixed in position. 